Cheap Newton Steps for Optimal Control Problems: Automatic Diierentiation and Pantoja's Algorithm
نویسنده
چکیده
In this paper we discuss Pantoja's construction of the Newton direction for discrete time optimal control problems. We show that Automatic Diierentiation techniques can be used to calculate the Newton direction accurately, without requiring extensive rewriting of user code, and at a surprisingly low computational cost: for an N-step problem with p control variables and q state variables at each step, the worst case cost is 6(p + q + 1) times the computational cost of a single target function evaluation, independent o f N, together with at most p 3 =3 + p 2 (q + 1) + 2 p(q + 1) 2 + (q + 1) 3 , ie less than (p+q+ 1) 3 , oating point m ultiply-and-add operations per timestep. These costs may be considerably reduced if there is signiicant structural sparsity in the problem dynamics. The systematic use of checkpointing roughly doubles the operation counts, but reduces the total space cost to the order of 4pN oating point stores. A n a i v e approach to nding the Newton step would require the solution of an N p N p system of equations together with a number of function evaluations proportional to N p , so this approach to Pantoja's construction is extremely attractive, especially if q is very small relative t o N. Straightforward modiications of the AD algorithms proposed here can be used to implement other discrete time optimal control solution techiniques, such as diierential dynamic programming (DDP), which use state-control feedback. The same techniques also can be used to determine with certainty, at the cost of a single Newton direction calculation, whether or not the Hessian of the target function is suuciently positive deenite at a point o f i n terest. This allows computationally cheap post-hoc veriication that a second-order minimum has been reached to a given accuracy, regardless of what method has been used to obtain it.
منابع مشابه
Some Eecient Algorithms for Unconstrained Discrete-time Optimal Control Problems
The diierential dynamic programming algorithm (DDP) and the stage-wise Newton procedure are two typical examples of eecient local procedures for discrete-time optimal control (DTOC) problems. It is desirable to generalize these local procedures to globally convergent methods. One successful globalization was recently proposed by Coleman and Liao 3] which combines the trust region idea with Pant...
متن کاملFad and Spg for Optimal Control
Automatic diierentiation and nonmonotone spectral projected gradient techniques are used for solving optimal control problems. The original problem is reduced to a nonlinear programming one using general Runge-Kutta integration formulas. Canonical formulas which use a fast automatic diierentiation strategy are given to compute derivatives of the objective function. On the basis of this approach...
متن کاملTraining Large Neural Networks
We describe regularization tools for training large-scale artiicial feed-forward neural networks. We propose algorithms that explicitly use a sequence of Tikhonov regularized nonlinear least squares problems. For large-scale problems, methods using new special purpose automatic diierentiation are used in a conjugate gradient method for computing a truncated Gauss-Newton search direction. The al...
متن کاملخوشهبندی خودکار دادهها با بهرهگیری از الگوریتم رقابت استعماری بهبودیافته
Imperialist Competitive Algorithm (ICA) is considered as a prime meta-heuristic algorithm to find the general optimal solution in optimization problems. This paper presents a use of ICA for automatic clustering of huge unlabeled data sets. By using proper structure for each of the chromosomes and the ICA, at run time, the suggested method (ACICA) finds the optimum number of clusters while optim...
متن کاملOptimization with the time-dependent Navier-Stokes equations as constraints
In this paper, optimal distributed control of the time-dependent Navier-Stokes equations is considered. The control problem involves the minimization of a measure of the distance between the velocity field and a given target velocity field. A mixed numerical method involving a quasi-Newton algorithm, a novel calculation of the gradients and an inhomogeneous Navier-Stokes solver, to find the opt...
متن کامل